Using Graphs for Shallow Question Answering on Legal Documents

نویسندگان

  • Alfredo Monroy
  • Hiram Calvo
  • Alexander F. Gelbukh
چکیده

This work describes a Shallow Question Answering System (QAS) restricted to legal documents. This system returns a set of relevant articles extracted from several regulation documents. The set of relevant articles allows inferring answers to questions posed in natural language. We take the approach of representing the set of all the articles as a graph; the question is split in two parts (called A and B), and each of them is added as part of the graph. Then several paths are constructed from part A of the question to part B, so that the shortest path contains the relevant articles to the question. We evaluate our method comparing the answers given by a traditional information retrieval system—vector space model adjusted for article retrieval, instead of document retrieval—and the answers to 21 questions given manually by the general lawyer of the National Polytechnic Institute, based on 26 different regulations (academy regulation, scholarships regulation, postgraduate studies regulation, etc.); with the answer of our system based on the same set of regulations. The results show that our system performs twice as better with regard to the traditional Information Retrieval model for Question Answering.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Identification of Concepts and Conceptual relations from Pa- tents Using Machine Learning Methods

This paper presents a machine learning approach to automatically extract concepts and the conceptual relations towards creation of Conceptual Graphs (CGs) from patent documents using shallow parser and NER. The main challenge in the creation of conceptual graphs from the natural language texts is the automatic identification of concepts and conceptual relations. The texts analyzed in this work ...

متن کامل

Patent Document Summarization Using Conceptual Graphs

In this paper a methodology to mine the concepts from documents and use these concepts to generate an objective summary of the claims section of the patent documents is proposed. Conceptual Graph (CG) formalism as proposed by Sowa (Sowa 1984) is used in this work for representing the concepts and their relationships. Automatic identification of concepts and conceptual relations from text docume...

متن کامل

NLP for Shallow Question Answering of Legal Documents Using Graphs

Previous work has shown that modeling relationships between articles of a regulation as vertices of a graph network works twice as better than traditional information retrieval systems for returning articles relevant to the question. In this work we experiment by using natural language techniques such as lemmatizing and using manual and automatic thesauri for improving question based document r...

متن کامل

Efficient Question Answering with Question Decomposition and Multiple Answer Streams

The German question answering (QA) system IRSAW (formerly: InSicht) participated in QA@CLEF for the fifth time. IRSAW was introduced in 2007 by integrating the deep answer producer InSicht, several shallow answer producers, and a logical validator. InSicht builds on a deep QA approach: it transforms documents to semantic representations using a parser, draws inferences on semantic representatio...

متن کامل

University of Hagen at QA@CLEF 2008: Efficient Question Answering with Question Decomposition and Multiple Answer Streams

The German question answering (QA) system IRSAW (formerly: InSicht) participated in QA@CLEF for the fifth time. IRSAW was introduced in 2007, by integrating the deep answer producer InSicht, several shallow answer producers, and a logical validator. InSicht realizes a deep QA approach: it transforms documents to semantic representations using a parser, draws inferences on semantic representatio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008